Solution of the Density Classification Problem with Two Cellular Automata Rules 
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Recently, Land and Belew [Phys. Rev. Lett. 74, 5148 (1995)] have shown that no one-dimensional 
two-state cellular automaton which classifies binary strings according to their densities of l's and 
O's can be constructed. We show that a pair of elementary rules, namely the "traffic rule" 184 and 
the "majority rule" 232, performs the task perfectly. This solution employs the second order phase 
transition between the freely moving phase and the jammed phase occurring in rule 184. We present 
exact calculations of the order parameter in this transition using the method of preimage counting. 
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I. INTRODUCTION 

In recent years, cellular automata (CA) jl| have re- 
ceived considerable attention as models of natural sys- 
tems in which simple local interactions between compo- 
nents give rise to a complex global behavior. Such sys- 
tems have the ability of coordinated global information 
processing, often called "emergent computation," which 
could not be achieved by a single component. Since emer- 
gent computation occurs in many biological systems such 
as the brain, the immune system, or insect colonics, it is 
natural to ask how the evolution produces such complex 
information processing capabilities in ensembles of simple 
locally interacting elements. 

To model this process, genetic algorithms have been 
used to evolve cellular automata capable of performing 
specific computational tasks, in particular the so-called 
density classification task The CA performing this 
task should converge to a fixed point of all l's if the 
initial configuration contains more l's than O's, and to a 
fixed point of all O's if the initial configuration contains 
more O's than l's. This should happen within M time 
steps, where, in general, M can depend on the lattice size 
L (assuming periodic boundary conditions). 

The earliest proposed solution to this problem was the 
two-state radius-3 rule constructed by Gacs, Kurdyumov, 
and Levin (GKL) ||. According to this rule, if the state 
of a cell is 0, its new state is determined by a major- 
ity vote among itself, its left neighbor, and its second left 
neighbor. If the state of the cell is I, its new state is given 
by the majority vote among itself, its right neighbor, and 
its second right neighbor. It has been demonstrated that 
the GKL rule performs the density classification task only 
approximately, i.e., not all initial configurations are clas- 
sified correctly. In particular, when the initial density 
is close to 1/2, approximately 30% of the initial config- 
urations are misclassified. Attempts to evolve CA that 
perform density classification task resulted in rules com- 
parable to GKL in terms of proficiency, but not better, 
typically classifying correctly about 80% of all possible 
initial configurations [^J. In fact, it has been recently 
proved by Land and Belew [Q that the perfect two-state 
rule performing this task does not exist. 



If we think about the cellular automaton as a model 
of a multicellular organism composed of identical cells, 
or a single kind of "tissue," we could say that evolution 
reached a "dead end" here. In the biological evolution, 
when the single-tissue organism cannot be improved any 
further, the next step is the differentiation of cells, or 
aggregation of cells into organs adapted to perform a 
specific function. In a colony of insects, this can be in- 
terpreted as a "division of labor," when separate groups 
of insects perform different partial tasks. For cellular au- 
tomata, this could be realized as an "assembly line," with 
two (or more) different CA rules: the first rule is iterated 
t\ times, and then the resulting configuration is processed 
by another rule iterated t 2 times. Each rule plays the role 
of a separate "organ," thus we can expect that such a sys- 
tem will be able to perform complex computational tasks 
much better that just a single rule. In what follows, we 
will show that for the density classification problem this 
is indeed the case, and that the perfect performance can 
be achieved with just two elementary rules (184 and 232) 
arranged in the "assembly line" described above. 

II. RULE 184 

Let Q = {0, 1} be called a symbol set, and let S be 
the set of all bisequences over Q, where by a bisequence 
we mean a function on Z to Q. The set S will be called 
the configuration space. A block of length n is an ordered 
set &1&2 • • • b n , where n 6 N, hi £ Q. B n denotes the set 
of all blocks of length n. The number of elements of B n 
(denoted by card23„) equals 2". 

A mapping / : {0, l} 3 i— > {0,1} will be called an 
elementary cellular automaton rule. Alternatively, the 
function / can be considered as a mapping of B 3 into 
Bq = Q = {0, 1}. Corresponding to / (also called a local 
mapping) we define a global mapping F : S S such 
that [F(s)]i = /(sj_i, Si, s i+ i) for any s e 5". 

A block evolution operator corresponding to the local 
rule / is a mapping f : B n 1— > B n ~2 defined as 

f(b 1 b 2 ...b n )= (1) 
f(bi,b2,b 3 )f(b2,b 3 ,b 4 ) . . . /(b„_2, b n -i, b n ), 
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where n > 2. 

Rule 184 has been studied as a simplest model for the 
road traffic flow || . Its rule table 

000 -> 0, 001 -> 0, 010 -> 0, Oil -> 1, 
100 -> 1, 101 -> 1, 110 -> 0, 111 -> 1 

can be interpreted in terms of "cars" (ones) and "empty 
spaces" (zeros). "Cars" are moving to the right. If the 
"car" has an "empty space" in front of it, it will move 
there, i.e., it will move one unit to the right. Under this 
rule, the number of "cars" does not change, or in other 
words, the density of l's is conserved. 

In order to understand the dynamics of rule 184, let us 
define a preimage of a finite block a £ Bk as a block b £ 
Bk+2 such that f (b) = a. Similarly, a n-step preimage of 
the block a £ Bk is a block b £ Bk+2n such that f™(6) = a, 
where the superscript n denotes multiple composition of 
f, i.e., iterating f n-times. For a given elementary rule, 
the number of n-step preimages of a given block b (we 
will denote this number by card f~™ (&)) can be anything 
from to 2 2+2rl . For many rules, an exact expression for 
cardf~"(fo) can be found ||, and the "traffic" rule 184 is 
among such exactly solvable cases. 

For convenience, let us consider the preimages of the 
block 00 under fig4. We will first prove the following 
proposition: 

Proposition 1 The block S1S2 • ■ ■ S2n+2 is an n-step 
preimage of 00 under rule 184 if and only if s\ = 0, 

s 2 = and 2 + Yli=3 £( s i) > f or every 3 < k < 2n + 2, 
where £(0) = 1, £(1) = -1. 

We will present only a sketch of the proof here, based 
on the concept of "defects" 0-0. Since the dynam- 
ics of rule 184 can be viewed as "particles" or "defects" 
(blocks of two or more 0's or l's) propagating in the reg- 
ular background (periodic pattern of alternating 1 and 0, 
. . . 101010101 . . .), it will be useful to introduce a trans- 
formation eliminating the background. Let us consider 
a block of length 2n + 2, and let us check whether it is 
a preimage of 00 or not. To eliminate the background, 
we first identify all continuous clusters of at least two ze- 
ros. Each site in such a cluster is replaced by the symbol 
a, except the leftmost 0, which is replaced by *. Sim- 
ilarly, in every cluster of at least two ones, every 1 is 
replaced by the symbol /?, except the rightmost 1, which 
is replaced by *. All remaining sites are replaced by *. 
For example, the string 1010001011000010 will be trans- 
formed into * * * * act * */? * -kaaa * *. The dynamics of 
the rule can now be understood as a motion of blocks of 
a's and /3's in the background of Vs. Every time step, 
each block of a's moves one unit to the right, and each 
block of P's one unit to the left. When a and f3 blocks 
collide, each block decreases its length by one per time 
step, until one of them (or both) disappear. Let us now 
consider the block s\s 2 ■ ■ ■ S2n+2 iterated n times with 
rule 184. The state of the cell i at time t will be denoted 
by s\. Block s^s® ■ ■ ■ S2n+2 can be a preimage of 00 when 



s n s n+i = 00; or using our transformation, s"s^ +1 = *a. 
Since all blocks of a's are moving with constant speed, 
this means that at t = we must have s^s^ — * a or i 
using the original notation, s^s^ = 00, which means that 
a travels from i = 2toi = n + linrj time steps. It 
can travel this distance "safely," if and only if it does not 
collide with another /3-block. This can happen if all (3- 
blocks are annihilated before they hit our a, or in other 
words, if for every k such that 2 < k < 2n + 2 the num- 
ber of /3's in the subblock S3S4 . . . s° is smaller than the 
number of a's. Translating this back to the original no- 
tation (i.e., using 0's and l's) we obtain the conclusion 
of Proposition 1. 

The S3S4 . . . S2„+2 part of the preimage of 00 can there- 
fore be constructed by using the following algorithm: we 
start with an initial "capital" equal to 2. Every time 
we choose 0, our "capital" increases by 1, and when 
we choose 1, it decreases by 1. We have to find a 
path such that the "capital" never reaches zero - the 
problem known in the probability theory as the "gam- 
bler's ruin problem" [[11]]. Let us denote the "capital" 
at time t by c{t) = X)i=3?( s »)' an d let c(2) = 2. Our 
string S3S4 . . . S2m+2 can be now represented by a path 
c(2)c(3) . . . c(2m + 2). Geometrically, this can be viewed 
as a two-dimensional polygonal line joining points [t, c(t)] 
starting at (2, 2) and ending at (2m + 2, y) which neither 
touches nor crosses the horizontal axis. The number of 
all such paths can be computed usin g we ll known combi- 
natorial theorems (see, for example, fill) and is equal to 
N 2m ,y-2 - N 2m , y +2, where 

^(l (a a +b ))< W 

and Na^ = if b > a. 

In the path from (2, 2) to (2m + 2, y) we have (2m + y — 
2)/2 zeros and (2m— y+2)/2 ones. If we randomly choose 
l's with probability p and 0's with probability 1 — p, then 
the probability of selecting admissible path with a given 
"finite capital" y is 

(AWa - N 2m , y+ 2)p {2m - v+2),2 {l ~ p)^ + y^/ 2 (3) 

Taking into account the fact that y must be even and 
that the first two digits of the preimage must be 00, we 
conclude that the probability that a randomly selected 
string (if we select ones with probability p and zeros with 
probability 1—p) of length 2m + 2 is an m-step preimage 
of 00 is 

m+l 

P m (00) = (1 - pf ( N 2m,2k-2 - (4) 

fe=l 

~N 2m ,2k+2) P m - k+1 (l -p) m+k - 1 . 

Note that if we start from an infinite random initial con- 
figuration with the density of ones p, then the probability 
of the occurrence of the block 00 after m iterations of rule 
184 will also be given by P m (00). 
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Although eq. (0) is rather complicated, asymptotic ex- 
pansion for large t is possible. Using the Stirling formula 
to approximate binomial coefficients, after some algebra 
one obtains: 




nt 

[4p(l-p)]' 



otherwise, 



l-2p if p < i, 
otherwise. 



(5) 



(6) 



As we can see, (00) plays here a role of the order pa- 
rameter in a second order kinetic phase transition, with 
the control parameter p. The critical point is exactly at 
p = 1/2, and at the critical point P*(00) approaches its 
stationary value as t~ x l 2 . Away from the critical point, 
the approach is exponential, and it slows down as p comes 
closer to 1/2. For finite configurations (with periodic 
boundary condition) the performance of rule 184 in elim- 
inating 00 blocks for p > 1/2 is even better. 

Proposition 2 If the finite initial configuration consists 
of Nq zeros and N± ones, and Nq < N±, then after at 
most \_(Nq +Ni — 2)/2j time steps all 00 blocks disappear 
(\_x\ denotes the largest integer less or equal to x ). 

To see this, let us first consider Nq + N\ even, so that 
N% — No > 2. Let us further assume that after (No+N% — 
2)/2 time steps we still have at least one 00 block. This 
means that we can write our entire initial configuration 
as aia2 . . . onq+Nx satisfying hypothesis of Proposition 1, 

i.e., a\ — 0, a2 = and 2 + J2i=3^.( a i) > f° r everv k 
such that 3 < k < Nq+N\. Note, however, that if a\ = 0, 
a 2 = then 2 + Y^i^^ 1 £( a i) < 0, and since it contra- 
dicts the previous statement, a\a2 ■ ■ ■ apf a+ N 1 cannot be 
a preimage of 00. The proof for odd Nq + N\ is similar. 
Also, due to the self-duality of rule 184, the same theorem 
holds for the block 11 when N > N x . When N = N ly 
both 00 and 11 blocks disappear after (N + N x - 2)/2 
time steps, and the configuration becomes an alternating 
sequence of and 1, . . . 01010101 . . .. 

To summarize, we found that for a finite lattice of 
length L and the density p, after \_(L — 2)/2j iterations 
of rule 184 the resulting configuration 

• contains no 00 blocks if p > 1/2, 

• contains no 11 blocks if p < 1/2, 

• contains neither 00 nor 11 blocks if p = 1/2. 



III. RULE 232 

Rule 232, also called the "majority rule," has the fol- 
lowing rule table: 



000 
100 



0, 001 
0, 101 



0, 010 

1, 110 



0, 011 -> 1, 

1, 111 -> 1, 



which could be also written as 



majority(s* 



1 S i' S i+l> 



(7) 



Let us assume that the initial configuration includes no 11 
blocks, but at least one 00 block. It is easy to check that 
the only preimages of 11 under f 2 32 are 0110, 0111, 1011, 
1101, 1110, and 1111, and all of them contain at least one 
subblock 11. This means that if the initial configuration 
contains no 11 block, then all subsequent configurations 
contain no block 11 either. Consequently, all entries in 
the rule table which contain 11 (i.e., 110, 111, and 011) 
do not matter, and we can change them without affecting 
the dynamics. Assuming that they are mapped to zero 
we obtain a "simplified" rule 



000 -> 0, 001 
100 -> 0, 101 



0, 010 

1, 110 



0, 011 -> 0, 
0, 111 -> 0, 



which has the code number 32. The following property 
of rule 32 can be easily proved by induction: 

Proposition 3 The block b £ £>2n+i is the n-step preim- 
age of 1 if, and only if, b = 101 . . . 01, i.e., it is an alter- 
nating sequence of 1 's and 's starting with 1 and ending 
with 1. 

Now, if L is odd, the [(L — l)/2]-step preimage of 1 
has to have the length L, so it has to be the entire initial 
configuration. If the entire initial configuration does not 
have the form required by Proposition |3| it cannot be the 
[(L— l)/2]-step preimage of 1. Therefore, after \{L— 1)/2] 
iterations of rule 232 the system converges to a state of 
all zeros. For even L this happens after (L — 2)/2 itera- 
tions. Similarly, if the initial configuration includes no 00 
blocks, but at least one 11 block, the system converges 
to a state of all ones. If the initial configuration contains 
neither 00 nor 11, it stays in this state forever. 

Using Propositions 2 and 3, our final result follows im- 
mediately: 

Proposition 4 Let s be a configuration of length L and 
density p, and let n = \_(L — 2)/2\, m = \_{L — 1)/2J. 
Then -F232(^ 7 i84( s )) consists of only 0's if p < 1/2 and 
of only l'sif p> 1/2. If p = 1/2, F% 2 (F? M (s)) is an 
alternating sequence of and 1, i.e., . . . 01010101 . . .. 

As we showed, first n iterations of rule 184 eliminate all 
blocks 11 if p < 1/2 (or 00 if p > 1/2), and the subsequent 
m iterations of rule 232 produce homogeneous configura- 
tion of of all 0's (or all l's). Configurations with p = 1/2 
are also treated properly, i.e., their density remains con- 
served and the converge to . . . 01010101 . . .. Examples 
are shown in Figure 1. 
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FIG. 1. Spatiotemporal diagrams for the (184,232) pair for 
lattice size L = 100 and the initial density a) 0.52 b) 0.5 c) 
0.48. An example of two a defects colliding with the defect 
is shown in a). Both rules were iterated 50 time steps (open 
arrows indicate where the iteration of rule 232 starts). 



IV. REMARKS 

In conclusion, we have demonstrated that the density 
classification task can be performed perfectly in L time 
steps with two cellular automata rules, rule 184 used in 
the first L/2 steps and rule 232 used in the remaining 
time steps. The advantage of this "assembly line" pro- 
cessing over a single rule is evident, as the single rule can 
never be 100% successful in density classification. 

The existence of this perfect solution does not mean, 
of course, that the evolutionary process could (or could 
not) produce such a pair of rules. Therefore, it would 
be interesting to design a genetic algorithm experiment 
in which pairs of CA rules are evolved, and find out how 
easy (or difficult) it is to produce pairs of "cooperating" 
rules performing better than single rules evolved in earlier 
experiments. Since the exact solution exists, we may 
speculate that the average performance of a pair obtained 
in such an experiment should be significantly better. 

Although the solution proposed here performs the task 
in L time steps, it is straightforward to construct a faster 
algorithm, providing that we allow rules of larger radius. 
If / is a radius-1 rule, than /", the rule iterated n times, 
is itself a CA rule of radius n. Therefore, the pair (g, h), 
where g = /{ l 84 and h — /| l 32 , performs the classification 
task in L/n time steps, assuming that we iterate both g 
and h for L/2n time steps. 

Another interesting question is the possibility of con- 
structing a general algorithm to discriminate configura- 
tions according to an arbitrary critical density p c . One 
promising approach to this problem involves generalized 
traffic rules, for example rules with higher "speed limits" 
J5| , where the occupied site can move to the right by up 
to m units if the sites in front of it are empty. Rules of 
this type exhibit a phase transition at p c = l/(m + 1) 
similar to the phase transition in rule 184. Any config- 
uration with p = p c converges to the periodic state of 
isolated l's separated by blocks of m zeros. Blocks of 
zeros longer than m are a-type defects, propagating to 
the right, while blocks of l's longer than 1 are defects 
of /?-type, propagating to the left. As in rule 184, (3 
defects are eliminated when p < p c , and a defects dis- 
appear when p > p c . One can also construct an analog 
of rule 232 which grows a defects if (3 defects are not 
present and conversely, and such a pair of radius- m rules 
can perform the p c = l/(m -I- 1) classification task for 
any integer m > (details of this construction will be 
presented elsewhere). It is not clear, however, how to 
generalize this method for arbitrary p c . 
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